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AMENDMENTS TO THE CLAIMS: 

1-48 (Cancelled) 

49. (Currently Amended) A machine implemented method for providing statistics 
characterizing content for language translation, comprising the steps of: 

accessing content in a first language, including content retrieved by following a link 
contain e d in crawling a web site via following links to additional pages cont e nt or cont e nt having 
markup tags ; 

parsing the content into one or more translatable components; and 
generating statistics based on the one or more translatable components. 

50. (Previously presented) The method according to claim 49, further comprising using 
the statistics to size the content for language translation. 

51. (Previously presented) The method according to claim 50, wherein the language 
translation includes human translating the one or more translatable components. 

52. (Previously presented) The method according to claim 49, further comprising the 
step of adding the one or more translatable components to a translation list for translation into a 
second language. 

53. (Previously presented) The method according to claim 52, wherein: 

the first language includes one of English, French, Spanish, German, Portuguese, Italian, 
Chinese, Korean, and Arabic; 

the second language includes one of English, French, Spanish, German, Portuguese, 
Italian, Japanese, Chinese, Korean, and Arabic; and 

the second language is different from the first language. 
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54. (Previously presented) The method according to claim 49, wherein the step of 
accessing includes retrieving the content in the first language from a source in a first language. 

55. (Previously presented) The method according to claim 49, wherein each of the 
translatable components is one of: 

a text segment; 
an image file; 
an audio clip; 
a video clip; 
a file; and 

any combination thereof in an electronic data stream. 

56. (Previously presented) The method according to claim 49, further comprising the 
step of generating an identifier for each of the translatable components so that each of the 
translatable components is accessible via a corresponding identifier. 

57. (Previously presented) The method according to claim 56, wherein the identifier for a 
text segment is generated using at least one of a hash code, a checksum, and a mathematical 
algorithm based on one or more text segments. 

58. (Previously presented) The method according to claim 49, wherein the statistics 
includes at least one of a file count, a page count, a text segment count, a unique text segment 
count, a word count, and a unique word count. 
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59. (Previously presented) The method according to claim 49, wherein the step of 
parsing is performed based on at least one markup tag contained in the content in the first 
language. 

60. (Currently Amended) A machine implemented method for providing statistics 
characterizing content, comprising the steps of: 

accessing content including content retrieved by following a link contain e d in crawling a 
web site via following links to additional pages content or cont e nt having markup tags ; 
parsing the content into one or more components; and 
generating statistics based on the one or more components. 

61. (Currently Amended) A system for providing statistics characterizing content for 
language translation, comprising: 

a content accessing unit configured for accessing content in a first language, including 
content retrieved by following a link contain e d in crawling a web site via following links to 
additional pages cont e nt or cont e nt having markup tags : 

an information processing unit configured for parsing the content into one or more 
translatable components; and 

a statistics generation unit configured for generating statistics based on the one or more 
translatable components. 

62. (Previously Presented) The system according to claim 61, further comprising a 
content sizing unit configured for using the statistics to size the content for language translation. 

63. (Previously presented) The system according to claim 62, wherein the language 
translation includes human translating the one or more translatable components. 
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64. (Previously Presented) The system according to claim 61, wherein the content 
accessing unit is further configured for facilitating retrieval of content from a data source in the 
first language. 

65. (Previously presented) The system according to claim 61, wherein each of the 
translatable components is one of: 

a text segment; 
an image file; 
an audio clip; 
a video clip; 
a file; and 

any combination thereof in an electronic data stream. 

66. (Previously presented) The system according to claim 61, wherein the statistics 
includes at least one of a file count, a page count, a text segment count, a unique text segment 
count, a word count, and a unique word count. 

67. (Previously presented) The system according to claim 61, wherein the first language 
includes one of English, French, Spanish, German, Portuguese, Italian, Chinese, Korean, and 
Arabic. 

68. (Previously Presented) The system according to claim 61, wherein the information 
processing unit parses the contentbased on at least one markup tag contained in the content. 

69. (Previously presented) The system according to claim 61, further comprising a 
translation list to which the one or more components are added for translation. 
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70. (Currently Amended) A machine readable medium having data stored thereon, the 
data, when read, causing the machine to perform the following: 

accessing content in a first language, including content retrieved by following a link 
contain e d in crawling a web site via following links to additional pages cont e nt or content having 
markup tags ; 

parsing the content into one or more translatable components; and 
generating statistics based on the one or more translatable components. 

71. (Previously presented) The medium according to claim 70, the data, when read, 
further causing the machine to use the statistics to size the content for language translation. 

72. (Previously presented) The medium according to claim 71, wherein the language 
translation includes human translating the one or more translatable components. 

73. (Previously presented) The medium according to claim 70, wherein the statistics 
includes at least one of a file count, a page count, a text segment count, a unique text segment 
count, a word count, and a unique word count. 

74. (Currently Amended) A machine implemented method for providing statistics 
characterizing content for language translation, comprising the steps of: 

accessing content in a first language, including content retrieved by following a link 
contain e d in crawling a web site via following links to additional pages cont e nt or cont e nt having 
markup tags ; 

parsing the content into one or more translatable components based on markup tags 
contained therein; and 



6 



Application No.: 10/784,334 

generating statistics based on the one or more translatable components, wherein the 
translatable components are to be human translated. 

75. (Previously presented) The method according to claim 74, further comprising the 
steps of: 

obtaining tracking information corresponding to a source from where the content is 
retrieved; and 

utilizing the tracking information when additional content is to be retrieved from the 

source. 

76. (Previously presented) The method according to claim 75, wherein the tracking 
information includes at least one of state information or session information. 

77. (Previously presented) The method according to claim 76, wherein the tracking 
information is obtained via a cookie. 

78. (Previously Presented) The method according to claim 74, wherein when the content 
contains an electronic form, the method further comprising the steps of: 

populating the electronic form automatically with pre-defined information; and 
submitting the populated electronic form to the source. 

79. (Previously presented) The method according to claim 74, wherein the source from 
where the content is retrieved is a web site. 

80. (Previously presented) The method according to claim 74, further comprising: 
identifying an image file associated with the content in the first language as a translatable 

component; 
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generating statistics for the image file. 

81. (Previously presented) The method according to claim 80, wherein the statistics for 
the image file is generated manually. 

82. (Previously presented) The method according to claim 80, wherein the statistics for 
the image file is generated automatically. 

83. (Currently Amended) A machine implemented method for providing statistics 
characterizing content for language translation, comprising the steps of: 

accessing content in a first language, including at l e ast on e of w e b cont e nt, cont e nt 
having markup tags th e r e in, and content retrieved by following a link contain e d in crawling a 
web site via following links to additional pages content or content having markup tags ; 
parsing the content into one or more translatable components; and 
generating statistics based on the one or more translatable components, wherein the 
statistics are used to measure the size the content for language translation. 

84. (Currently Amended) A system for providing statistics characterizing content for 
language translation, comprising: 

a content accessing unit configured for accessing content in a first language, including -at 
lea s t on e of w e b content, cont e nt having markup tags th e r e in, and content retrieved by following 
a link contain e d in crawling a web site via following links to additional pages cont e nt or cont e nt 
having markup tags ;; 

an information processing unit configured for parsing the content into one or more 
translatable components; and 
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a statistics generation unit configured for generating statistics based on the one or more 
translatable components, wherein the statistics are used to measure the size the content for 
language translation. 

85. (Currently Amended) A machine readable medium having data stored thereon, the 
data, when read, causing the machine to perform the following: 

accessing content in a first language, including at l e ast on e of w e b cont e nt, cont e nt 
having markup tags therein, and content retrieved by following a link contain e d in crawling a 
web site via following links to additional pages cont e nt or cont e nt having markup tags ; 

parsing the content into one or more translatable components; and 

generating statistics based on the one or more translatable components, wherein the 
statistics are used to measure the size the content for language translation. 
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